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ABSTRACT 

* ^ ■ * . ' * * • - 

• In 1970 the Institute of Library Research converted to machine- 
rea4^ble form over 1 million catalog card records representing about ^ 
750,000;unique Roraan-language titles cataloged by UC libraries during - 
the .period '1963-1967, and .from' this data base printed the UC Union 
Catalog Supplemen^fc (UCUCS-jt) . ' * Some. 1, 7 million additional Roman- 
language card records representing monographs, cataloged by UC libraries 
between 1968-1972 (here called UGUCfS- 2) have been collected and 
manually pre-processed by ILR* Equivalent machine language .records 
now need to be obtained for this new gro\ip of records. This* study 
* determined th^/ei^tent to wliich'"uqUCS-2 records alfe available in already 
existing UC data bases* and in a few *'outside" data bases. 

^ A 1/2% stratified sample was dfrawn^frora each UC camyus, totalling 
8,337 records. ' (Records at UC .Sa.ntli Cruz are already in machine- 
readable form and so were, not considered in this sample..) Of these, 
about 48% could be^ found, in one of the machine, files already av^ailable 
at, and used by, the UC system: about :2!7% were' available in tC MARC • 
files., about 9% in the University of Call,fornia Santa Cruz (UCSC) 
files, and about 12% could b% found in the UCUCS-l files. Of those 
not' found in any of the 3 UC files, about 30% (an estimated 217,000'^ 
records) could be located in the OCLC data base. These findings in- 
dicate that of the total 1,697,822 .UCUCS- 2 Roman- language records,- 
at least 807,000 records could be copied, from existing machine files 
\( these figures being adjusted to allow for 9% o^ the source records 
found to be out of the -defined sample scope — e.g., non-monographic 
materiats). 

The sample was also analysed by language, by imprint dajte, and ^ 
by availability of a unique identification number, such as ISBN or ^ 
LC Card Number. 8fl% of the UCUCS-2 sample could be associated with 
some kind of unique identification number, but only 63% of the total, 
or about J. 06 million records, actually had this number on the catalog 
record, the others having to 'be searched in other files. 
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I. INTRODUCTION ' 



. Thd Institute of Library Research (ILR) hao already converted to 
macMne readable form over 1 million catalog car:d recordOepresenting 
about. 750,000 unique Rbman-language monographic titles cataloged by UC 
libraries during the period 1963-67, From this computer data base, the 
47-volumeUC Union Catalog Supplement (here referred to as UCUCS-1) was 
produced and is now in use in each of «the nine UC campuses, the California 
State library/ and each of the 19 California State University and Colleges 
System ckm^uses, 

^ " ■ ^ . I . • ■ . , . . 

Some 1.7 million additional Roman-language card records, representing 

monographs cataloged by the UC libraries during the period 1968-72 have also 
been collected and pre-processed By ILR, and the cards aire .now warehoused in 
the Richmond storage facility awaiting further planning and processing. 
These records are referred to as UCUCS-2 records. (All of the UC Santa Cruz 
records are already on computer tape and wili be available wtien appropriate 
for further UCUCS-2 processing.) 

There seems to be agreement within the UC Library System that the 

^ UCUCS-2 records -now warehoused in Richmpnd should be converted into machine 
readable form. The qu^lElon is how to convert these records as economically 
and easily as possible. .Sinte no firm commitment has yet been made to any 
specif ic product or service (e.^., printed or microform author-title book 
catalog, or on-line searching), this proposed machine file ought to have 
^ the flexibility to generate a wide, variety of products or services such as 

. catalog cards, circulation records, a systemwide 'shelf list, and printed 
book -catalogs or supplements. 



Some of the tard recordo are already available' in laachine form frpm 
external source files cuch aa the LC tIARC, OCLC, UClJCSt-l, and UC Santa Cru2 
machine filea, an^ could very likely be copied therefrom with ^iess time and 
cost than required for original converoion. Some records are not available 
in mchine form and would have to be converted,^ 

Ken Weeks, who headed the ^project at its start, was responsible for 
most of the data collection effort for this study, and v/as later assisted 
by other graduate students of the UCB School of LibTarianship^^ including 
Hancy Chriotenson, Ron Heckart, Ned Himael, Jane^Irby, and Bob Treppa,. o 



.II. OBJECTIVES 

The Hiajor objectlveo of thia study were to: 

1. Determine the magnitude of the UCUCS^ conversion problem 

2. Determine the extent to- which UCUCS-2 records were already 

available on some ^xi^ting data bases ' * ' 

Additional lesser objectives were to: / . 

t 

1. Determine the extent to which the UCUCS-2 records overlapped 
with several existing machine bibliographic files 
• 2* Determine the nature of the library materials represented by 
the UCUCS-2 catalog records (i.e., the nature of the material 
acquired and cataloged by the UC campuses during the period 
1967 through 1972) 

3. Determine the extent to which some unique record identification 
'number such as the LC Card Number was available for each, of the 
UCUCS-2 records. 



\ 



3 



III. ^lETHOD OF APPROACH 

A. DEVELOPMENT- OF UCUCS-2 SAMPLE 

All of the UCUCS-2 records from the UC Santl^^Cruz campus library already 
exist in machine readable form as a byproduct of that library's continuing / 
local book catalog production efforts, and do not pose any significant con- 
version ptoblem iot the subsequent UC union catalog or union file efforts. 

Consequiwtly* this study was really interested only in the catalog records 

^ - it 

from the other, eight canjpuses. ^ 

In order to provide an^accurarte and meaningful basis for our analysis 
efforts, a stratified random sample was drawn cfrom the UCUCS-2 cards sub-* 
mitted by each of the campuses other tl\gn, Santa Cruz. Because ^^11 of the/ 
UCUCS-2 cards which had previously been sent ^o ILR had gone th/rough* some 
.preliminary processing ('e.g*, re-packaging, numbering, microfilming) we had 
relatively good estijnates of the total number of catalog cards submitted by 
each cainpuo. This helped the sampling plan considerably. Because we had a 
good estimate of the total number of cards from each caihpus, we could then 
easily specify a sample size td be drawn from each campus' UCUCS-'2 records. 
And because all of the cards had been stamped with a unique number by a 
numbering machine it was relatively easy to use a random number table to 
draw numbered records from each campus' records in order ,to meet the. speci- 
fied sample sizek The selected records were extracted from their containers 
^nd copied onto a template such as that shown in Figure 1. A total of 8,337 
records were copied in this manner to be uhed ae the source data for this 
^tudy. 
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Ridley, Nicholas, Bp. of London, 1500?-1555.. ^ 
The works of Nicholas Ridley. C19d83 



(Card 2) 



■ -1. Chtirch of Eagland - Collected works . 
2. Theology - Collected works - 16th cent. 
I. Christmas, Henry, l8ll'-l868, ed. 
(Series) 



I! 

J 



LC llARC 
^JCSC-.73 
. UCUCS-1 
OCLC 



YES 



NO 



D 

UU2 

L3- 

1959 



Langsam, Walter CD^suelo , 1903- 'ed. 

Documents and readings in the history of 
Europe since iQlQ cbya Walter Consuelo Langsam, 
withJ the assistance of James titchael Egan 
ri.e. Eaganj/Rev. and enl. Philadelphia, 
■ Lippincotc^ 1951.^ New York, Kraus Reprint Co 

XXV, 1X9OP. 24cin. . \ 
/ — . ■ 
• 1* Europe - Hist. - I9l8-19^. - Sources. , 
I. JBagan,, James l^ichael, jt. author. II. Title. 
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IS?** 



■ - • - ■ -Fffo.M cm 

?arker, John Henry, 1806-188U;- •. 

An introduction to the Study of Gothi-9 ar- 
chitecture. Ifth ed.., rev. and enl. Oxford 
& London, J, H. and J. Parker, I87V. 

XX, 3'29p. lllus. 17cm. 



1. Arch icec tare » Gothic. 




Figure 1. 
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B, SEARCH FOR OVERLAP IN EXISTING MACail^ I-flLES . • 

1. Search In UC Files ' — - ■ ■ ' 

Xhere are three existing. UC machine files that fan help with' the yCUCS-2 
conversion^ effort • ^ AIL of these data bases aye inmiediateiy accessible to 
the UC University-wide Library Automation Prqgram (ULAP) with nb extra acqui- 
sition or royalty costs: the fudl retrospective LC I4ARC data base is alre*a4y 

■ ' ' . ■ . . # 

stored at the Data Processing Center in Berkeley and used on a regular pro- 
ductlon basis by Bibcent^r; the UCSC data base can be obtained readily from 

UCSC when needed; and the UCUCS-1 da.ta base presently resides at ULA? and . 

^ ' *» ■• • 

plans are being made to prepare it for. regular 'Bibcenter production searching. 

It was assumed that in the dvent that a record inight be available fi?6m 
more than one machine* file, there would be a preference for one file over 
the other©. It was agreed that the most desirable file (because of its high 
quality *and'detait o£ fagging of bibliographic eljements aftd i£s ready avail- 
. ability a^: low cost to UQ) would/be the LC MAftC'.file, This is clearly thfe . 
first choice among the available « files* The next choice (because of its - 
relatively clean and complete MARC format records) is the UC Santa Cruz^^' , . 
machine data base covering the entire UCSC library Holdings. Tl\e third ^ 
chdice would be the UCUCS-1 data base, with its lesser quality and level 
of tagging^ ^ . ' 

Because of tl^ls preferential selection policy, most of • our searching 
of the sample records was done on a sequential basis, following' thfe stated 
preferences^ A record was searched initially to see, if it was in. the LC 
MARC data base; if it- was, the search terminated at that point. If the 
iTecord was not in the LC data base, it was then searched in the UCSC. ^ 

d^ta base, and so on. A few of the sample records were searchied in all of 
the data bas^s during an ea^ly part of this study in order to estimate the 
relative overlap in the several * machine data bases. ^ , ' . 



The search process asually followed the 'sequence illustrated in Figures o ' 

2. . Th^ f irst step was to examine the sample^ecbjrd to ^ee if/ it was ' really ^"Vi 

a ^ catalog record for monographic material^; The original UCUCS effort was 

; " .V • ...^ r ^ . • 

intended by policy. to include only monograph materials, However, a sig- 

*. ■ ■ ■ 

nificant amount of non-motidgraph material actually was submitted by the 

pdmpuses and included in the printed book catalog. T?he exact: scope of % . 

the UCUCfa-Z conversion effort has not beeii' established yfeti However, for . 

the purpose of this" study we^made the simplifying assumption that only 

monograph records would be included. With this as&uini^Uioa we then pul]e(l. >^ 

from the ' sample all non-monograph catalog records as well as all other . 

records that h&d slipped^ past thp initial UCUCS-2 pre-processing activities -V 

(e.g.; catalog records with nonrRoman characters, notes ^ and order slfps). 

, . \' • 

This excluded material .turne^ out to be a significant fraction of the sample. 

The- remaining records were immediately examined to see if by inspection 
;they coul4'^be positively identified asvbeing in thd LC MAHC data This 
is indicated by the notation "LC MARC" or "MARC" on the card. . These cards 
were immediately set g^ide Irl a separate pile. , ' 

ThjB remaining cards' yere then examined to see if they were 'probable . 



MARC** cardis. Bedausfe of the way in which retrospective and. continuing con- 
. version efforts afce^'being made at the Library of Cortgress (LC) there is no 
publication date at which one can absolutely rule out the possibility of 
an LC MARC recp.?d; Howeve'r, there is some frequency distribution informa- 
tion'for ptiblication dates that can provide some- useful guidance,.^ Table 1, 
for ex^ple, obtained from a recent unpublished analysis of the /complete LC 
MARC file, shows: the number' q% LC MA&C records as of mid-i975vthat had the.* 



indicated publication dates. Few records are available with publication 
'^ates before 1960, ;and LC was generating very few foreign language catalog 



PUBLIOATION 
DAfE 

NO DME 

1960 
1961 
1962 
1963 
1964 
1965 
1966 
1967 
, 1968 
1969 
1970 
1971 
1972 
1973 
1974 
1975 
1976 
• 1979 



NUMBER OF RECORDS 
WITH THIS PATE 

146 

\ 16,183 
;2,376 • 
2,321 
3,067 
3,852 
5,093 
6,922 
19,170 

36,235 ^ 
60,652 
61,382 
63,960 
62,378 
64,561 
66,205^ 
49,900 
19,358 
435 

J. 
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TABLE 1 

Distribution of Publication Dates in Full LC MARC Data Base 

as of July 1975 



* ■ • 

records for the publication dates covered in tha UCUCSr2 material* For 
these reasons y we mafte a simplifying assumption and judged any record at 
this point that w^^^ a foreign language and had an imprint of 195*9 or 
earlier to be- a non-LC MARC record; all of the remaining records at this 
point x^re^till "probable MARC" records, - 

%f such a "probable tlARC" record did not have an LG card number, as 
subsequt^ntly determined by a search in the National Union Catalog publica- 
tipnSy it was assumed' not to be in the LC MARC data baseband was then 

transferred to the non-MARC pile. > ^ . . 

*• . ■ , . 

^ , " , ■ ■ ^' 

There were several ways in which the "probable MARC" records could 

be resolved. The way we chose was to examine a printed LC Card NuDjber 

index to the full LC MARC data base that was prepared for the UC Bibcenter 

file. The lookups were speeded up by first manually sorting, the samjple 

records into LC Card Number order. Records that were found in this 8earct\ 

were then annotated with the number and transferred to the LC MARiC record 

pile.: ^Because the printed index was prepated by the UC Bibcenter several 

.4 

V ■ ^ ' . ■ 

months p,rior to the time at which we were doing our searching, there was 
the possibility that some of the -"probable MARC" records might have been ' 

0 

added to the LC MARC data base by LC after our index was printed. In ^ 
order to bring our search up to date and cover absolutely everything that 
was in the LC MARC data base, we then searched' some of the remaining 
"probable MARC" records on-line against the most recent LC MARC records 
in the Stanford BALLOTS system. A total of nine LC MARC records were 
found here as the result of 215 seairches. The remaixiihg 894 "probable 
MARC" records were i4:hen searched against the latest copy of the MCRS 
microfilm index of the LC MARC data base that is distributed weekly and 
provides the most up-to-date index^ to the full LC MARC data base. A 



ll^iaall number of additional MARC records were found thib way ^and resolved 
all of the "probable MARC'^ records. V 

i» • - 

All of the rion-MARC records were then given a grosis alphabetic sort by 

the first two or three characters of the author or title entry in order to 

— ■ ' ^ 

speed up the manual lookups in the printed 1973 UCSC Uuthor-title book 
catalog. Matching records for the UCSC data baae were set aside in a-'sep- 
arate pile. * ^ 

s 

The -remaining sample records that were not found Un the LC MARC or UC5C 
catalogs were then searched in the printed UCUCS eluthor-tltle book* catalog 
and split into piles for matches and non-iaatches. This compl^eted the search 
process in the UC machine files. • • \ /\ 

It would be a relatively straightforward task to ^featch the remaining • 

r 

non-matching (i.e'. , "residue") material in any of sevetal other available 
machine data base^, and this in fact was done with the OCLC data base. 

In all of the searches a match was recorded when ther^ was either an 

* ♦ • 

exact match or a "near" match. A near match was defined as a situation in 
which the catalog records were identical except for < few minor changes 
such as edition, date, or place of ptiblication. In one of our initial 
tests we found that broadening the iverlap to include near matches in- 
creased the total overlap figures by about five percent. 



2. Search in Ohio Collefte.v.Jblbrary 



Center (OCLC) File 



Arrangements were made with OCLC to permit a terminal at ILR to be 



used to search the OCLC data base o 



t-Xine. This was .done ov6r the Tymnet 



data communications network and was ^restricted by the T^pnet line to a, — 
transmission rate of 30 characters p^r second. A Texas Instruments Silent 
700 thermal printer terminal was used at this 30 character per second rate 
to do the searching and to obtain a printed record of all of the search 

11 ^ . 



tranaactlons. Most of the searches were done by ILR staff members with 

library experience, although a few searches were made by staff members with 

, less experience. All of the searching was done during the months of January 
^ » * 

„ ' to July 1975. 

' It was known*. i^ advance that vOCLC searching by LC Card Number (LCCN) 

r: .... I , . _ 

was* considerably faster than searching by , title or autf|gi?- title -seairch keys. 
For that reason, LCCN searching wak done first for any r^feord that had such 
a number qn the card. Cards th^t did not get a hit withl* thiioLCCN search or 
/ did not have an LCCN were given a^ title or author-title seai/el^i Because 
the' title search w«s generally faster than an author-title seatifeh and 
.> yielded fewer erroneous citations lit response to the search codeA it was 
the preferred form of search for the cards that did not get a hit on LCCN 
Searching, Cards that d5;d not get a hit on a title search were alst) given 
an author- title search. . , 

The final results of ,th\2 search effo'rt were then annotated on each of 
the sample source records. ■ ^ , 



el 
th 



<3 ^ 



• . IV. FINDINGS 

^. TOTAL SOURCE RECORDS TO BE COHVERTED • * . 

trCUCS-2 cards were received .in 1973 from all campiises except Santa Cruz, 
noted earlier, all o£^ the data and the converDion problem relate to the 
ht non-UC Santa Cruz campuses. No cards were requested or received from 

more than 100 UC libraries that are not. affiliated with the. university 

■'■ i ' ■ . . ' . " ' . " 

library systems of each campuff. • 

(Under the direction of Tom Hi^groye of 'iLR* all of the cards received 
by ILR were given some pre-procea^;lng which included re-pa&kaging into • 
•standard size boxes, serial num4ritig of each card, microfilming of each 
card, the separation and separate boxing of records that contained noh->Roman 
character! and the exclusion* of some records (e.g», notes and non-monograph t 
catalog rehords)' that were not meant for conversion. As a result of this 
pre-procesLing effort, some fairly accurate data was available from Tom 
Hargrove regarding the gross counts of the source records. 

Becaust of the special and difficult problems associated with the 
computer representation and processing of non-Roman alphabetic information, 
mbst of thi records submitted for UCUCS-1 that contained non-Roman-alphabet 
informationlwere set aside for special processing and not included in UCUCS-1. 
These approximately 63,000 records have still not received any" processing. 
It is assumekthat the same practice will be followed for UCUCS-2, namely 
that the non-Roman material will be separated out and not converted with 
\5the other material, Thus only the Roman-alphabet'material is of interest 
to the conversion problem. It should be clear, however ,. that some con- . 
sideration eventually needs to be given to^the approximately 193,000 non- ' 
Roman records (63,78A for UCUCS-1, 130,099 for UCUCS-2) that have been 



'subtaitted to date for the UC union catalog effor1:s» 

Because ,th'e 'non-'Eomait records ha^e already been set aside for separate 
processing, this study concerned itself only with planning for the conver- 
sion of the Roman-alphabet records. According to the data in Table 2, this 
leaves^s with 'the problem of handling almost 1.7 million catalog cards as 
inpii# to the qonversian effort. Our problem is described in terms of cards 
rather tfe^ti titles bjacause all of the programming, numbering, linkage of 
contfnuatioa cards, and control of card conversion is done by a locally 
assigned card number gather than title. VJe will in fact be faced with a 
total number. o£ tlilSts wMch is smaller than this card count. 
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iroMBER OF CARDS RECEIVED 

1 « SPECIAL 

eAMPUS . ROMAN TiiON-ROMAN^ ^- MATERIAL^ TOTAL ' 

■ ^ « , . ■ ■> \ . . . 

Santa Cruz ° 88,000 * - " 88,000 

' (tape records) ' " (tape records) 



Lo3 Angeles 


341,472 


33,734 Y . 

' . It 


13,790' 


388,^96 


Santfa Barbara 


283,632 


•20,742 '\ 


16,284 


320,658 


Berkeley 


266,696 


,53,373 


325 . 


320,394 


Davis 


228,762 


8,407 


25 


237,194 


San Diego 


209,921, 


3,948 


24,900 


238,769 


Riverside 


198,987 


6,361 


183 


205,531 


Iryine 


141,384 

* 


3,036 


2,475 


• ' ./146,895 


San Francisco 


26,968 


498 


525 


; \>27,991 


TOTAL CARDS: 


1,697,822 


130,099 


58,507 


1,886,428 


TOTAL FILE: 


1,785,822 


130,099 


58,507 


1,974,428 



NOTES ' ' 

1. ROMAN—cards in all Roman-alphabet script, includ|Lng diactltics. 

2e_ N0N-^R0MAII— cards containing some NON-ROMAN script, even if text cataldged 
Is in Roman script, either as transliteration or ttanslation into a Roman- 
s8ript language. / ■ ■ ^ ' i 

' 3* SPECIAL — material requiring special handling due to non-regular card stock, 
special notation, illegibility, special cataloging— brief, photo-listing* 

' • • • 

TABLE 2 

Materials Received for UCUCS-2 > - 



B. . NATURE. OF THE SOURCE RECORDS TO BE COHVERTED ' . ' 

^ " ■ ' ' ■ ■ ' ' " ■■■ i ■ I ^1 ■ ' ■ ■ . ' ■ 

■ ' . ■ ' ■ ' ■ ' r 

The UCUCS*-2 records represent most of the monographs cataloged by the 
UC General Libraries during the period 1968 through 1972. ' As such, this 
sample represents much of the -type of material that was going into the UC 
collections during this recent 5-year period* . For that reason it is of 
interest to determine the general^ characteristics of this material. AH 
of the data in this section is based on a study of. our entire sample of 
source records. ■ - ' ."^ ' 

1. -. Language . ' ^' . 

♦ ■ 

An analysis of the language of publication of our sample monograph 
records shows that approximately 345^ of this material was in 4 foreign 
language.^ Berkeley had the highest percentage (44%) of foreign language 
monographs. Detailed data is given in Table 3. 

2. Imprint Date 

> The distribution of imprint dates of monograph material in the sample ' 
is given in Table 4 and illustrated in Figure 3. Almost half of the mate- 
tial was published during the 5-year period in which the UCUCS-2 material 
was collected, and about two-thirds of the UCUCS-2 material was published 
during the total UCUCS time period of 1963-72. About one quarter of the 
UCUCS-2 records were for material more than 20 years old* 

3. Availability of Unique Identification Number ? 
Suggestions have been -made of the possibility of preparing a computet- 

based numeric register or book catalog for the UC library resources in^^%^ 
manner similar to -that done recently for the Louisiana libraries.* Such' 
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* McGrath, William E. and Donald Simon, "Regional Numerical Union Catalog" 
on Computer Output Microfiche," Journal of Library Automation 5:4 (pecem-* 
ber 1972) 217-229. See also McGrath, William E. , "LNR: Numerical' Register 
of< Books in Louisiana." LLA Bulletin (Louisiana Library Association) 34:3 • 
^ (Fall 1973j) 79-86. ^ 

16 . . . . : . 



ENGLISH 
RECORDS 



NON-ENGLISH 
RECORfiS 



CAMPUS 


SIZE 


Berkeley 


- 1,134 


Santa Barbara 


1,453 


Riverside 


1^.235 


LoQ Angeles 


l,4;bl 


,Davi3 t 


M2 


Irvine 


666 


San Diego • ^ 


. 1,103 


Satt Francisco 


, ;95 


TOTAL ' 


8,029° 







Number 


Percent 






494 


43.56 




61 18 


564 


38.82 






438 


35.47 


952 


67.47 






656 


70.39 


276 


29.61 


487 


73.12 


^ 179 


26.88 


830 


75.25 . 


273 


. 24.75 


78 


f t 

82.11 » 


17 


17.89 


5,329 


66.37 


2,700 


■33.63 



TABLE 3 

Distribution of English and Non-English UCUCS-2 Monograph Records 
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a finding to& woul^ consist simply. of an LC Card Numbel: or ISBN, or ISSN, 
adong with a location codevfor\he holding library, and would be relatively 
inexpensive to produce. Obviouslji^ it could only be used as a limljted 
finding tool and would .perhaps, be of some interest ^o library selection 
staffs in the- context of cooperative collection development and utilizajtton. 
If such a numeric register weri^ to be considered, some information would be 
nec^sary to determine what fraction of the holdings liad, some kind of unique 
ldenlfc4,fipation number that-could be usedT. as the access point in this type of 

catalog. \ • 

- - , * ** ' '■ " ^ • ■ " 

Some the existing UC catalog records contain autfh a.. national or inter* 

national number on .the local catalog re^^prd* Some of th^ records are 

elated Kith numbers that can ^bnly, bp found by searching; some oth^r catfklgg 

st^ch as the National Union Catalog (NUC); this of ten^flippens, for e?ea#jple, 

when a catalog card is prepared lacaUy in a4van(2e;of cataloging by LC^ and 

not rep^ced or augmented subsequiently by the' IC information. , 

4 The data in Table 5 indicates^ that aboa£>j81% of the UCUCS-2 material 

had some sort of unique numbfer associated with it ,^ tut a Idrge fraction of 

■the n^umbers were not on th^ local *^cards and would have to bi found by a 

»Lrelatlvely time-constlmlng lookup process. About 63% of the^ total UCUCS-2^ 

' ■ ■ %■ 

. ■■•■»."■'■ . « ■ 

records had some type of ui^ique material identification numbe^^n ^the l^ocal 

catalog record. All of the records that had an ISBN ajso had an LC Card 

Numbei^. No ISSN entries were found even though it was theoretically possi^ 

ble to have an ISSN and LCCN on the same work; 

'Psing the best available -Information regarding the total ftumber pf 

Roman-alphabet UCUCS-2 records, we see f r^ the /(Jatd in Table 6 that if a . 

numeric register were to be constructed, it coul-a represent about 1.06 

millibn of thesd cards If the information was taken directly^ from the 
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NUMBER OF ROMAN-ALPHABET CARD RECORDS 
RECEIVED FOR UCUCS-2 



CAMPUS 

Los Angeles 

Santa Barbara 

Berkeley - 

Davis 

SaiyDlego 

Riverside 

Irvine 

San Francisco 
TOTAL 



Total 
Records 

341,472 

f,632 
,696 
228,762 
209,921 
198,987 
141,384 
26,968 
1,697,822 



Total With 
Number 
On Card 

174,492 

207,902 

•159,698 

154,872 

141,403 

107,154 

99,138 

17,316 

1,061,975 , 



Total With » 
Number Found 
After Lookup 

. 78,402 

^38,858 

45,872 

40,491 " 

41,669 

44, '314 

23,993 

2,840 

-316,439 » 



Total With 
Number 

252,894 

246,760 

• 205,570 

195,363 

183,072 • 

15J,468 

123,131 

. 20.156' 

1,378,414 



, TABLE 6' 

Number of Records with a Unique National or International Identification Number 



22 



available cards, and a total of about 1.. 38 million of these records if 

■ . ■ ' . ■ , <f • ' 

; additional lookups were to" be made to search for missing numbers • 
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C. AVAILABILITY OF SOURCE RECORDS iN EXISTING MACHINE FILES 
1. Availability Ih^theUC Files (LC MARC, UCSC« UCUCS-1) 

Because It Is generally so much more expensive to create a new machine 
record than to copy It from some of the existing data bases, of ''catalog 

records, .one major file conversion policy would be to make as much use as 

' ^ ' , . ■ ■ ■■ 

^possible of available machine data bases. Project and budget planning for 

UCUCS-2 record conversions will nead a good jS^jtidUte of the number of records 

■ • , • / ■ ■■• . ■ ■ . 

thilt might be copied, as well as the nu^al^er of records that might have to go 

■/ ' • . ft. 

through an original conversion process. ' # 

Using. the search procedures and sequehce described in the earlier section 
on our method of approach, the records for each campus were examined to find 
the extent to which they overlapped N^h three existing UC machine data bases: 
LC MARC, UCSC, and UCUCiS-1. Thg results of this analysis of 8,337 sample 
records are summarized in Tabl$ 7* > 

As shown by the data in Tables 7 and 8, about 27% of the UCUCS-2 material 
can presently be taken from the LC MARC data base. .Over 15% of the source' 
cards will be immediately identified as LC MARC r^rSs by inspection of th^ 
card, and another 11% of the source c^ards could only be identified as LC MAl^C 
records by a bibliographi^c author-title search in the National Union Catalog 



(1956-68 cumulation, and 1968-^72 cumulatior 

A total of over 48% of the UCUCS-^2 niaterial can be taken from at least 
one of these three UC files. Some of the remainder can be taken from other 
files such as OCLC 6r .SDC/Information Dynamics LIB^OnI 

5 1 

For some oj^ the campus samples, the records were searched against each 
of^he three UC data bases to find the extent to which Mch of these data - 
bases contributed unique records. These searches indicatted that each^of * 
the three data bases contribute some records that were unique to that data 
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ITEMS IDENTIFIABLE ADDITlONAI. ITEMS FOUND 

BY INSPECTION BY BIBLIOGRAPHIC SEARCHING' 

TOTM, AS MARC REC ORDS TO BE fiARC RECORDS TOTAL MARC RECORDS 

• SAMPLE " 



CAMPUS 


SIZE 


Number 


Percent 


Number 


Percent 


Number 


Percent 


I - 


704 


166 


''23.58 


101 


14.35 . 


267' 


37.93 


= SD 


X,X56 


150 


12.98 




-M7.04 


347 


30.02 


D 


1,015 . 


187 J 


■ 18.42 


120 


iiM : 


307' 


• 30.25 


SB 


1,587 


3Q8 


19.41 


• . ' 141 


8.88 


449 


28.29 


SF 


.97 


13 


13,40* 


13 . 


13,40 


26 


26.80 




1,423 


. 227 


15.95 


134 




361 


25.37 


B • 


1,238 


187 . 


15.11 


115 


9.29 


w 


24.39 


LA 


1,617 ; 


154 


9.52 


"196 


12. U 


350 


21.65 


TOTAL 


8,837 


1,392 




1,017 




2,409 




COMPOSITE % 




15.75 




11.51 . 




; 27.26' 



TABLE 9 . i 
UCUCS-2 Monograph Records Identified aii LC MARC Records 



>aae» This data base overlap information io illustrated in Figures 4, 5, 
and 6. ' ^ 

The initial estimates given in Tables 2 and 7 for the $ize of the con- 

version problem can be modified slightly as a result of this detailed record- 

■ , ■ ■ . » "* • 

by-record examination of the 8,337 sample records^ This detailed examination 
found that a small fraction of the numbered and counted UCUCS'-2 material can 

be excluded from further processing. The UCUCS-2 Roman*-alphabet source 

^ '-,.*. 
records, Sotalling^ 1,697,822 cards in Table 2 were assumed to be all iaono« 

4,' 

graph catalog records ready for conversion. However,>' the detailed review 
of our sample indicates, as shown by tfie data itt/lfable 7, that, a fraction 
of these records are actually for serial records or other material that was 
defined to be out of scope foi; UCUCS-1 .material, and assumed to be out- of 
scope for this study. The 'UCUCS-2 pre-processing operation removed much of 
this material, but some of it still slipped throught When this out-of-scope 
material, estimated* to be about 9 percent of the tbtal Roman-alphabet card 
records, is subtracted from the total received UCUCS-2 records, we have a 
smaller total number of records for conversion than our earlier estimate 
given i^ Table 2. This total-still assumes phat none of the card records 
will be manually consolidated l^^fore conversion. 

With this minor adjustment, as shown in Table 7, we now see a total 
of at least 807,000 UCUCS^2 card records that can be copied from existing 
UC machirie files* • 
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24.2% 
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I LC MARC (30.3% TOTAL) 
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I (9.6% TOTAL) 



I J 



ERIC 



Figure 4. UC File Overlap for Riverside lICUCS-2 Records 
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i3.5% 



' LC MARC (36.2% TOTAL) 




UCSC (6.8% TOTAL) 



.1 



UCUCS-l (11.9% 



ERIC 



Figure 5. UC File Overlap for Davis UCUCS-2 Records 
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I LC MARC (3^.3% TOTAL)* 



I 



28.0% 



1. 



2.7% 



1.0% 
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i+_ — __ 
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I 
I 

I 
I 



14.0% 



6.7% 1 3.7% 



I 
I 

I 



. 4.3% 



.UCSC (l5.3% TOTAL) 

^ - ■ 



t_ 



UCUCS-1 (21.0% TOTAL) 
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Figure 6* UC Hl« Overlap for Irvine VCUGS-2 Records 
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Z, Availability In the OCLe File 

^ . . . • • . 

The Ohio College^ Library Center (OCLC) preoently hao over one million 

• • • ' o ■ ■ 

• ■ % ■ ♦ . ^ " ■• , 

catalog records available for on-line eomputer GoarcMng and copying. 
Approx^telyAoO,000 of these recorda duplicate the LC MARC file that UC 
already maintains*! . ^ 

All of the UCUCS-2 reqotds that were not found in any ''of the three UC 
files were searched against OCLC. - « 

In summary, as shown by the data in Table 9, a total of 30% of the- 
residue records (i.e., those records not, found- in any of the three UC data 

.. ■ ■ ^ J - 

ba^es) were found in^he'JOCLC data base. Extrapolating this to the total 
UCUCS~2 card file. We find that after first searching against the three ^ 
available UC f ilea?, ah additional 217,000 records could be foi;nd in the^ 
OCLC file. 



,31 

38 



GAMens 

Log Angeles 

Santa Barbara 

Davis 

Berkeley 

San Diego 

Riverside 

Irvine 

San Francisco 
TOTAL 



Total 
Number in 
Residue 
Sample 

883 

676 

404 

705 ; 

4b3 

^488 

223 

42 



3,774 



Ntmber of 
Residue 
Sample 
in OCLC 



Percent 
of Residue 
Sample 
in OCLC 



Estimated 

Total 

Number 

of - Residue 

Records 




Estimated ^ 

Number 

of Residue 

Records 

in OCLC 



175,926 


41,606 


120,827 


39,858 


91,047 


35,833 


• 151,*884 |. 


29 ,300 


,7S,200 i 


26,337 


68,253 


26,014 


44,776 ' - 


/• 15,662 




2.779 


^^7^590 


'217,389 



1. The residue is defined as those records that coulfl not be found in any of 
the 3 UC files (LC MARC, UCUCS-1, UCSC). ™ 



TABLE 9 



Number of Residue Records Included in OCLC Data Bdse 
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3. Availability in Other Files 



. Several other data bapeo eKist that should be oxamlned to determine 
the .extent to which they ^ould contribute additional r/aeords to assist the 
UGUCS-2 record conversion effort. Files that already exist and merit addl- 
tional examlnati<3n are: ' - • ' \ / • 

. • ^DC/Inf^rmatlon Dynamics -LIBGON ( 1.2* pillion index entries , but a 
. ssu^U ^nUB&er of non-LG MARC records) * 

" BALLOTS (for the several thousand locally-generated records) 

' ■ ^ " ■ 

• • Blaekwell North America Inc. ' 

. Auto-Gjraphics - 1 ^ 

;. New York Public Library V 
Ir^ ^General Research Corp. ' ' 

. Information Design \, . ' ^ 

Several other files that may be prepared in the near future, should alsrf be 
examined, including: ^ ' ' . 

• California State Library (planned conversion of union, catalog) 
. CSUC shelf list conversion. 
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RECENT ILR PUBLtCAXrOHS 



Publications of pa]p*t» «ad tepotts of^itttecir»t co scholars and practitiontrs In fehs 
field of libtaty and iafontatiaa tcitace is au iapotcant fuacfcion of cha Intcitutt of 
library ttstatch. In additioo to this seudyt cha following hava bta^i publishad cacently 

by mi < ; • • 

Todd» ludyi Smaaary Report of Studant Studies of tha Sublact Haadings Usad 
' in tha Uni yraity of Califo rnia, 8atfcfclcy> Subject Cacaloy (July 1973) 
n S pp. (ERIC No. EJ)-082 775) .; ' 

»■ < 

ltR-73*002 Bourne, Charles P.« and Ja Robinson, SPI Cieatlan Cheeklrtg as a Measure of the 
Performance of library Docuaient Pelivery S yacecs (July 1973) 10 pp» 
(ERIC 5(0, £0-082 7^74) -^^^ - ' 

11.11-73-003 Weeks, Kenneth, Dateraination of Pre-'Acqnisition Predtctore Book Use: ytnal 
Report (July 1973) 20 pp* (ERIC No* ED-'OSZ 776) 
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